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Abstract 

This paper is first devoted to study an adaptive wavelet based estimator of the long memory parameter 
for linear processes in a general semi-parametric frame. This is an extension of Bardet et al. (2008) which 
only concerned Gaussian processes. Moreover, the definition of the long memory parameter estimator is 
modified and asymptotic results are improved even in the Gaussian case. Finally an adaptive goodness- 
of-fit test is also built and easy to be employed: it is a chi-square type test. Simulations confirm the 
^ , interesting properties of consistency and robustness of the adaptive estimator and test. 

o 

^ ', 1 Introduction 

\ The long-memory processes are now a subject area well studied and often applied (see for instance the book 
edited by Doukhan et al, 2003). The most famous long- memory stationary time series are the fractional Gaus- 
sian noises (fGn) with Hurst parameter H and FARIMA(p, d, q) processes. For both these time series, the 
spectral density / in follows a power law: /(A) ~ C A"^'' where 77 = d -I- 1/2 in the case of the fGn. This 
behavior of the spectral density is generally a definition adopted for a stationary long memory (or long range 
• ^ . dependent) process even if this definition requires the existence of a second order moment. 
/\ ' There are a lot of statistical results relative to the estimation of the long memory parameter d. First and 
^ . main results in this direction were obtained for parametric models with the essential papers of Fox and Taqqu 
■ ■ ' (1986) and Dahlhaus (1989) for Gaussian time scries, Giraitis and Surgailis (1990) for linear processes and 
Giraitis and Taqqu (1999) for non linear functions of Gaussian processes. 

However and especially for numerical applications, parametric estimators are not really robust and can in- 
duce no consistent estimations. Thus, the research is now rather focused on semiparametric estimators of 
the d. Different approaches were considered: the famous and seminal R/S statistic (see Hurst, 1951), the 
log-periodogram estimator (see Moulines and Soulier, 2003), the local Whittle estimator (see Robinson, 1995) 
or the wavelet based estimator (see Veitch et al, 2003, Moulines et al, 2007 or Bardet et al, 2008). All these 
estimators require the choice of an auxiliary parameter (frequency bandwidth, scales,...) but adaptive versions 
of original estimators are generally built for avoiding this choice. In a general semiparametric frame, Giraitis et 
al (1997) obtained the asymptotic lower boimd for the minimax risk of estimating d, expressed as a hmction of 
the second order parameter of the spectral density expansion around 0. Thus, several adaptive semiparametric 
arc proved to follow an oracle property up to multiplicative logarithm term. But simulations (see for instance 
Bardet et al, 2003 or 2008) show that the most accurate estimators arc local Whittle, global log-periodogram 
or wavelet based estimators. 

The use of a wavelet based estimator for estimating d was first proposed in Abry et al. (1998) after pre- 
liminary studies devoted to selfsimilar processes. Then Bardet et al. (2000) provided proofs of the consistency 
of such an estimator in a Gaussian semiparametric frame. Moulines et al. (2007) improved these results, 
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proved a central limit theorem for the estimator of d and showed that this estimator is rate optimal for the 
minimax criterion. Finally, Roueff and Taqqu (2009a) established similar results in a semiparametric frame for 
linear processes. All these papers were obtained using a wavelet analysis based on a discrete multi-resolution 
wavelet transform, which notably allows to compute the wavelet coefficients with the fast Mallat's algorithm. 
However, there remains a gap in these papers: in a semiparametric frame the "optimal" scale used for the 
wavelet analysis is depending on the second order expansion of the spectral density around frequency and 
these papers consider that the power of the second order expansion is known while this is unknown in practice. 
Two papers proposed a method for automatically selecting this "optimal" scale in the Gaussian semiparametric 
frame. Firstly, Veitch et al. (2003) using a kind of Chi-square test which provides convincing numerical results 
but the consistency of this procedure is not established. Secondly, Bardet et al. (2008) proved the consistency 
of a procedure for choosing optimal scales based on the detection of the "most linear part" of the log-variogram 
graph. In this latter article, the "mother" wavelet is not necessary associated to a multi-resolution analysis: 
the time consuming is clearly more important but a large choice of continuous wavelet transforms can be 
chosen and the choice of scales is not restricted to be a power of 2. 

The present article is devoted to an extension of the article Bardet et al. (2008). Three main improvements 
are obtained: 

1. The semiparametric Gaussian framework of Bardet et al. (2008) is extended to a semiparametric frame- 
work for linear processes. The same automatic procedure of the selection of the optimal scale is also 
applied and this leads to adaptive estimators. 

2. As in Bardet et al. (2008), the "mother" wavelet is not restricted to be associated to a discrete multi- 
resolution transform. Moreover we modified a little the definition of the sample variance of wavelet 
coefficients (variogram). The result of both these positions is a multidimensional central limit theorem 
satisfied by the logarithms of variograms with an extremely simple asymptotic covariance matrix (see ©) 
only depending on d and the Fourier transform of the wavelet function. Hence it is easy to compute an 
adaptive pseudo-generalized least square estimator (PGLSE) of d which is proved to satisfy a CLT with 
with an asymptotic variance smaller than the one of the adaptive (respectively non-adaptive) ordinary 
least square estimator of d respectively considered in Bardet et al. (2008) and Roueff and Taqqu (2009). 
Simulations confirm confirm the good performance of this PGLSE. 

3. Finally, an adaptive goodness-of-fit test can be built from this PGLSE. It consists in a normalized sum 
of the squared PGLS-distance between between the PGLS-regression line and the points. We prove that 
this test statistic converges in distribution to a chi-square distribution. Thus it is a very simple test to 
be computed since the asymptotic covariance matrix is easy to be approximated. When d > this test 
is a long memory test. Moreover, simulations show that this test provides good properties of consistency 
under and reasonable properties of robustness under Hi. 

For all these reasons, we can say that this paper is an achievement of the article Bardet et al. (2008). Moreover, 
the adaptive PGLS estimator and test represent an interesting extension of the paper Roueff and Taqqu (2009). 

We organized the paper as follows. Section[2] contains the assumptions, definitions and a first multidimensional 
central limit theorem, while Section |3] is devoted to the construction and consistency of the adaptive PGLS 
estimator and goodness-of-fit test. In Section 3] we illustrate with Monte-Carlo simulations the convergence 
of the adaptive estimator and we compare these results to those obtained with other efficient semiparametric 
estimators; moreover we study the consistency and robustness properties of the adaptive goodness-of-fit test. 
The proofs are provided in Section [5l 

2 A central limit theorem for the sample variance of wavelet coef- 
ficients 

For d < 1/2 and d' > 0, this paper deals with the following semi-parametric framework: 
Assumption A.[d,d'): X = (Xt)ti£Z *s a zero mean stationary linear process, i.e. 




t G Z, where 
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• (^s)s(EZ is o- sequence of independent identically distributed random variables such that the distribution 
of £,0 is symmetric, i.e. Pr(^o > M) ~ Pr(^o < —M) for any M e M, and El^ol"^ < oo with E^o = 0, 
Var^o = 1 and ^4 := E^q,- 

• {a(t))t^x is a sequence of real numbers such that there exist > and Cd' G K satisfying 

= ^{ca + cd'\'''{l+e{\))) for any Ae[-7r,^], (1) 
where a{\) := ^ J^kez a{k)e-^^^ , with e{\) ^0 



As a consequence, the spectral density / of X is such that 

/(A)=2^|S(A)p (2) 

and satisfies the same kind of expansion than ([T]). Thus, if d € (0, 1/2) the process X is a long-memory process, 
and if d < a short memory process (sec Doukhan et ai, 2003). 

Now define ?/> : M — > R the wavelet function. Let k E N* . Wc consider the following assumption on -0: 

Assumption -0 : K ^ M is such that 

1. the support of is included in (0, 1); 

2. [ V(i) dt = 0; 



k 



3. ■4}EC^{M.). 

Straightforward implications of these assumptions are = = for any < j < k and ^'(w) ~ C 

(u — > 0) with C a real number not depending on u. 

Define also the Fourier transform of ip, i.e. ipiu) ip{t) e~*"*(ii. Assumption Assumption \E'(A;) also implies 

that -tp has a fast decay at infinity. 

If F = (Yt)tgR is a continuous-time process, for (a, 6) G x R, the "classical" wavelet coefficient d{a,b) 
of the process Y for the scale a and the shift b is d(a, h) := ^ /jj i/;(^)lt dt. However, a process X satisfying 
Assumption A((i, d') is a discrete-time process, and from a path {Xi, . . . , X^) we define the wavelet coefficients 
of X by 

, N N 
e^{a,b) ) = ^^a(t-5)^( )?. (3) 

for (a, 6) e X Z. Then, 

Property 1. Under Assumption A{d,d') with d < 1/2 and d' > 0, and if ip satisfies Assumption '^(k) with 
k > d' , then (e(a, fc))6g{i,....Ar-a} is o, zero mean stationary linear process and 

E(e2(a,0)) = 27rcrf(i^(^,2d) a''' + —if(^,2d-d') +°("'''"''') " ^ 

with K[^ -£)-^ such that 



K(^,a)-= \i:{u)\^\u\-°'du> for all a <1. (5) 



The proof of this property, like all the other proofs, is provided in Section [5l 

Let {Xi, . . . , Xn) be a sampled path of X satisfying Assumption A{d, d'). Property allows an estimation of 
2d from a log-log regression, as soon as a consistent estimator of E(e^(a,0)) is provided. For this and with 
1 < a < A^, consider the sample variance of the wavelet coefficients. 



N-a 

T^{a):^- 5]e2(a,fc). (6) 

iV — a ^ — ' 



k=l 
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Remark 1. In Bardet et al. (2000), (2008) or in Moulines et al. (2007) or Roueff and Taqqu (2009), the 
considered sample variance of wavelet coefficients is 



VNia) 



1 



[N/a] 



[N/a] 



(7) 



fc=i 



(with a = 2^ in case of multiresolution analysis). The definition ^ has a drawback and two advantages with 
respect to this usual definition it is not adapted to the fast Mallat's algorithm and therefore more time 
consuming, but it leads to more a simple expression of the asymptotic variance and simulations exhibit that 
this asymptotic variance is smaller that the one obtained with 

The following proposition specifies a central limit theorem satisfied by logr/v(a), which provides the first 
step for obtaining the asymptotic properties of the estimator by log- log regression. More generally, the following 
multidimensional central limit theorem for a vector (log r/v(ai))j can be established, 

Proposition 1. Under Assumption A[d,d'), d < 1/2 and d' > 0, and if tp satisfies Assumption ^'(fc) with 
k > 2. Define £ G N \ {0,1} and {ri,--- ,rg) £ (N*)^. Let (a„)„gN be such that N/a^ — > oo and 

OAT Af-i/(i+2''') — > OO. Then, 

Af-s-oo 



(^logrjv(rja7v) - 2dlog{r^aN) - log 



•■2n' 



l<i<i W-s-oo 



A/',(0; r(ri,--- ,r,,^,d)), 



with r(ri, • • • ,r£,ip,d) = {'Jij)i<ij<i the covariance matrix such that 



An 



inr') 



I \l-2d 

j 



(■0,2ti) 



|^(r,A)riV'(r,A)| 



-dX. 



(8) 



(9) 



3 An adaptive estimator of the memory parameter and an adaptive 
goodness-of-fit test 

The CLT of Proposition [T] is very interesting because it has several consequences. We will see that the 
(quite) simple expression of the asymptotic covariance matrix is an important advantage compared to the 
complicated expression of the asymptotic covariance obtained in the case of a multiresolution analysis (see 
Roueff and Taqqu, 2009a). First it allows to obtain an estimator dN of d by using an ordinary least square 
estimation. Hence, define 



rfjv(ajv) (0 (^;„ Z,„)-iz;„(logTjv(r,ajv)), 



<i<e 



with Z„ 



I 1 log(aAr) ^ 
1 log(2aAr) 

V 1 \og{(iaN) ) 



(10) 



Remark 2. From Proposition^^ it is not possible to chose (ri, . . . , r^) for minimizing the asymptotic covariance 
matrix T(ri, ■ ■ ■ ^ri^ip^d) without knowing the value of d. Hence, in the sequel we will only consider the choice 
(ri,r2,--- ,rf) = (l, 2,. ..,£). 

Then, it is clear from Proposition [T] that dN{aN) converges to d following a central limit theorem with conver- 
gence rate i / — when un satisfies the condition ajv A^^i/(i+2'* ) — y oo. 



. " " TV- , ™ 

However, in practice, d' is unknown. In Bardet et al. (2008), an automatic procedure for choosing an "opti- 
mal" scale On has been proposed. We are going to apply again this procedure after recalling its principle: for 
a e (0, 1), define 

QN{a,c,d) = (YN{a) -Zn^{ ^^))' ■ (rw(«) - ( 3^ ))' "^^^^ = i^^sTNiiN")) 
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QN{ct,c,d) corresponds to a squared distance between the £ points (log(iA^") , logT/v(iA^")) . and a line. It 
can be minimized first by defining for a G (0,1) 

and tlien define Sn by: 

n ^ ■ n t ^ ^. a / ^ 3 \og[N/e] ^ 

QAf(Q!Ar) = mm gAr(a) where = <^ , — ^. 

aeAN LlogA' log A* logA* J 

Remark 3. As it was also claimed in Bardet et al. (2008), in the definition of the set An, logN can be 
replaced by any sequence negligible with respect to any power law of N . Hence, in numerical applications we 
will use 10 logA^ which significantly increases the precision ofotN- 

Under the assumptions of Proposition [1] one obtains (see the proof in Bardet et al., 2008), 

logflAT -p, * 1 



UN 



log N N^oo 1 + 2d' 



Then define: 



dAT := (i(7V"") and Tjv := r(l, ^at, V-)- (H) 



It is clear that — > d (a convergence rate can also be found in Bardet et ai, 2008) and therefore, from 

N^oo 

the expression of F in © and its smoothness with respect to the variable d. Fat — > F(l, • • • ,i,d,ip)- Thus 
it is possible to define a (pseudo)-generalized least square estimator (PGLSE) of d. Before this, define 

6Sjv log log 



(£-2)(l-SAr) logA^ 



For technical reasons {i.e. Pr(a7v < «*) — > 0), which is not satisfied by S^v, see Bardet et at, 2008), in the 

sequel we prefer to consider rather than Sat. Finally, using the usual expression of PGLSE, the adaptive 
estimators of c and d can be defined as follows: 

( ) (^Ar°«f^^Zjv5jv) Z'^^^fjf^YN{aN). (12) 

The following theorem provides the asymptotic behavior of the estimator d^, 

Theorem 1. Under assumptions of Proposition\^ with (t'^{£) := (O -) (Zj (F(l, • • • ,l,d,^)) ^ Zi) ^(O -)', 



^IdN-d) Af{0;al{£)) and Vp > l2^.\d^^d\ ^ 0. (13) 

Remark 4. 1. From Gauss-Markov Theorem it is clear that the asymptotic variance of d^ is smaller or 

equal to the one of dn ■ Moreover d^ satisfies the CLT hlS^) which provides confidence intervals that are 
simple to calculate. 

2. In the Gaussian case, the adaptive estimator djq converge to d with a rate of convergence rate equal 

d' 

to the minimax rate of convergence A^i+2<j' up to a logarithm factor (see Giraitis et al., 1997). Thus, 
this estimator can be compared to adaptive log-periodogram or local Whittle estimators (see respectively 
Moulines and Soulier, 2003, and Robinson, 1995). 

3. Under additive assumptions on ip (ip is supposed to have its first m vanishing moments), the estimator 
dN can also be applied to a process X with an additive polynomial trend of degree < m — 1 . Then the tend 
is "vanished" by the wavelet function and the value of djsi is the same than without this additive trend. 
Such robustness property is not possible with an adaptive log-periodogram or local Whittle estimator. 
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Finally it is easy to deduce from the previous pseudo-generalized least square regression an adaptive goodncss- 
of-fit test. It consists on a sum of the PGLS squared distances between the PGLS regression line and the 
points. More precisely consider the statistic: 

TN^j^{Y^{aN)-Z^.4 )pN'{yN{SN)-Z^.^{ )). (14) 

Then, using the previous results, one obtains: 
Theorem 2. Under assumptions of Proposition^^ 

T^r x'(£-2). (15) 

This (adaptive) goodncss-of-fit test is therefore very simple to be computed and used. In the case where > 0, 
which can be tested easily from Theorem [TJ this test can also be seen as a test of long memory for linear 
processes, ion 

4 Simulations 

In the sequel, the numerical consistency and robustness oidM are first investigated. Simulation are realized and 
the results obtained with the estimator are compared to those obtained with the best known semiparametric 
long-memory estimators. Finally numerical properties of the test statistic T^r are also studied. 

Remark 5. Note that all the softwares (in Matlab language) used in this section are available with a free 
access on http : / / samm.univ-parisl . f r/-Jean-Marc-Bardet . 

To begin with, the simulation conditions have to be specified. The results are obtained from 100 generated 
independent samples of each process belonging to the following "benchmark". The concrete procedures of 
generation of these processes are obtained from the circulant matrix method in case of Gaussian processes or 
a truncation of an infinite sum in case of non-Gaussian process (see Doukhan et al., 2003). The simulations 
are realized for d = 0, 0.1, 0.2, 0.3 and 0.4, for N = lO'^ and 10* and the following processes which satisfy 
Assumption A{d,d'): 

1. the fractional Gaussian noise (fGn) of parameter H = d + 1/2 (for < d < 0.5) and cr^ = 1. A fGn is 
such that Assumption A{d, 2) holds even if a fGn is generally not studied as a Gaussian linear process; 

2. a FARlMA[p,d,q] process with parameter d such that d € [0,0.5), p,q S N. A FARlMA[p, d, q] process 
is such that Assumption A{d, 2) holds when E^q < oo where is the innovation process. 

3. the Gaussian stationary process X^''-''^ \ such that its spectral density is 

/3W = ^(1 + ^'') for AG [-^,0)U(0,^], (16) 

with d £ [0,0.5) and d' G (0,oo). Therefore the spectral density f^ is such that Assumption A{d,d') 
holds and since X'^'^''^ ^ is a Gaussian process, from the Wold decomposition it is also a linear process. 

A "benchmark" which will be considered in the sequel consists of the following particular cases of these 
processes for d = 0, 0.1, 0.2, 0.3, 0.4: 

• Xi : fGn processes with parameters H ^ d + 1/2; 

• X2 : FARIMA[0, d, 0] processes with standard Gaussian innovations; 

• X3 : FARIMA[0, d, 0] processes with innovations following a uniform U[—l, 1] distribution; 

• X4 : FARIMA[0, d, 0] processes with innovations following a symmetric Burr distribution of parameter 
(2, 1) {i.e. its cumulative distribution function is F{x) = (1 — i (1 -|- x'^)~^) Ix>o + 5(1 + 2;^) IIx<o); 

• X5 : FARIMA[0, d, 0] processes with innovations following a Cauchy distribution; 

• Xq : FARIMA[1, d, 1] processes with standard Gaussian innovations, MA coefficient (p ~ —0.3 and AR 
coefficient = 0.7; 
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• X-! : FARIMA[1, d, 1] processes with innovations following a uniform 1] distribution, MA coefficient 
~ —0.3 and AR coefficient (p ~ 0.7; 

• ATg : X'^'^''^ ) Gaussian processes with d! = 1. 

Note that the processes X4 and X5 do not satisfy the condition E^q required in Theorems [T] and [2] However, 
since we consider the logarithm of wavelet coefficient sample variance and not only the wavelet coefficient 
sample variance, it should be possible to prove the consistency of djv under a condition such as E^q with r > 2 
and perhaps only r > 0... 



4.1 Comparison of the wavelet based estimator and other estimators 

First let us specify the different choices concerning the wavelet based estimator: 

Choice of the function ip: as it was said previously, it is not mandatory to use a wavelet function associated 
with a multi-resolution analysis. We use here the function ip{x) = x'^{l — x)^(^x^ — | + ^ x — Y]-)l2:e[o,i] 
which satisfies Assumption 5* (2) 

Choice of the parameter £: This parameter is important to estimate the "beginning" of the linear part 
of the graph drawn by points {log{iaN),^ogTN{iaN))i<i<e and therefore the data-driven un- Moreover this 
parameter is used for the computation of as the number of regression points. We chose a two step procedure: 

1. following a numerical study (not detailed here), i = [2 * log{N)] (therefore i — 13 tor N ~ 1000 and 
£ = 18 for N = 10000) seems to be a good choice for the first step: compute Sn- 

2. for the computation of d^, we first remark that with the chosen function tJj, Tn does not seem to 
depend on d. As a consequence we decide to compute a-^{£) = (O ^) (Z( (r(l, • • • ,£,d, -0)) Zi) (O 

for several values of d and i using classical approximations of the integrals defined in r(l, • • • , ^, d, V')- 
The results of these numerical experiments are reported in Figure O The conclusion of this numerical 
experiment is the following: for any d G [0, 0.5[, o'^(^) is almost not depending on d and decreases when £ 
increases. Therefore we chose for this second step £ = 7Vi~"« (log N)~^: by this way the larger considered 
scale is A^(log7V)~^ (which is negligible with respect to N and therefore the CLT[8]holds). 



0.22 r 
0.21 - 



0.2 - 
0.19 - 
_ 0.18 - 



-N=10 
-N=20 
-N=100 
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- N=500 



0.17- 
0.16 - 
0.15 - 



0.25 
d 



Figure 1: Graph of the approximated values of (j'^{£) defined in[T^for d £ [0, 0.5] and N = 10, 20, 50, 100, 200 
and 500. 

Now we consider the previous "benchmark" of processes and apply the estimator dN and 2 other semiparametric 
d-estimators known for their accuracies: 

• dMS is the adaptive global log-periodogram estimator introduced by Mouhnes and Souher (1998, 2003), 
also called FEXP estimator, with bias- variance balance parameter k ~ 2; 

• da is the local Whittle estimator introduced by Robinson (1995). The trimming parameter is m = N/30. 
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Simulation results arc reported in Table [TJ 

Conclusions from Table{^' The wavelet based estimator numerically shows a convincing convergence rate 
with respect to the other estimators. Both the "spectral" estimator dji and dms provide more stable results 
almost not sensible to d and the flatness of the spectral density of the process, while the convergence rate of the 
wavelet based estimator djv is more dependent on the spectral density of the process. But, especially in cases 
of "smooth" spectral densities (fGn and FARIMA(0, d, 0)), d^ is a very accurate scmiparametric estimator 
and is globally more efficient than the other estimators. 

Remark 6. In Bardet et al. (2008) we also compared two adaptive wavelet based estimators (the one defined 
in Veitch et at, 2003 and the one defined in Bardet et al., 2008) with dMS and dji (and also with two 
others defined in Giraitis et al., 2000, andjjiraitis et al., 2006, which exhibit worse numerical properties of 
consistency). We observe that \/ MSE of d^ obtained in Table[l\is generally smaller than the one obtained 
with the estimator defined in Bardet et al. (2008) for two reasons: the choice of the definition (0j of wavelet 
coefficient sample variance instead of ^ and the choice of a PGLS regression instead of a LS regression. 

Comparison of the robustness of the different semiparametric estimators: To conclude with the 
numerical properties of the estimators, 3 different processes not satisfying Assumption A{d, d') are considered: 

• a Gaussian stationary process with a spectral density /(A) = [l-^l ^ 7i'/2| "^'^ for all A G [— 7r,7r] \ 
{-7r/2,7r/2}. The local behavior of / in is /(|A|) - |A|"2d ^j^^ d = 0, but the smooth- 
ness condition for / in Assumption A(0, 2) is not satisfied. 

• a trended Gaussian FARIMA(0, d, 0) with an additive linear trend {Xt = FARIMAt + (1 - 2t/n) for 
i = 1, • • • , n and therefore mean{Xi, • • • , X„) ~ 0); 

• a Gaussian FARIMA(0, d, 0) with an additive linear trend and an additive sinusoidal seasonal com- 
ponent of period T = 12 {Xt = FARIMAt + (1 - 2t/?i) + sin(7ri/6) for t = 1, • • • ,n and therefore 
mean{Xi, • • • , X„) ~ 0). 

The results of these simulations are given in Table [2] 



Conclusions from Table [H' The main advantages of d^ with respect to di/5 and da are exhibited in this 
table: it is robust with respect to smooth trends (or seasonality). Note that the sample mean of dMs and d^ 
in the case of processes with trend or with trend and seasonality is almost 0.5 

4.2 Consistency and robustness of the adaptive goodness-of-fit test: 

Tables [1] and provide informations concerning the adaptive goodness-of-fit test. A general conclusion is that 
the consistency properties of this test are clearly satisfying when N is large enough (N = 1000 seems to be 
too small for using this goodness-of-fit test). 

We also would like to know the behavior of the test statistic under the assumption Hi. We are going to 
study the case of a process which does not satisfy either the stationarity condition cither the relation ([T]) also 
verified by the spectral density. Hence 3 particular cases arc considered: 

1. a process X denoted MFARIMA and defined as a succession of two independent Gaussian FARIMA pro- 
cesses. More precisely, we consider Xt = FARIMA{0, 0.1, 0) for i = 1, • • • ,n/2 and Xt = FARIMA{0, 0.4, 0) 
for t = n/2 -f 1, • • • ,n. 

2. a process X denoted MGN and defined by the increments of a multifractional Brownian motion (intro- 
duced in Peltier and Levy-Vehel, 1995). Using the harmonizable representation, define Y = (Yt)t such 
that 

where W{dx) is a complex-valued Gaussian noise with variance dx and -ff(-) is a function (the case 
H() = H with H e (0,1) is the case of fBm), C(-) i a function. Here we consider the functions 
alt) = 0.5 4- 0.4sin(i/10) and C{t) = 1. Then Xt = Yt+i - Yt for t e Z. X is not a stationary process 
but "locally" behaves as a fGn with a parameter H{t) (therefore depending on t). 
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099 






107 


105 


112 


110 


097 




V AJ SE (ijv 


0.048 


0.050 


0.053 


0.061 


0.074 






0.82 


0.82 


0.75 


0.73 


0.67 


^3 




u.uyz 


n nQ/i 


U.UoU 


n riQQ 
u.uyy 


u.uyo 




^/ AT Q TP 


n 1 1 

U . 1 1 o 




n 1 nn 


112 


u.uyo 




^ / A T Q T? ^ 


0.052 


0.071 


0.063 


0.0T7 


0.092 




Pn 


0.84 


0.72 


0.75 


0.67 


0.51 


y 


y l\i O H, (IMS 


U.Uoo 


u . u f y 


n n7Q 
u.u / y 


u.uyo 


n 1 ri/i 

U. lU'i 




V 1^ J-J LtJY 


096 


100 


103 


097 


095 




^ / AT Q TP W,, 


051 


066 


056 


061 


064 




Pn 


0.84 


0.78 


0.78 


0.75 


0.66 


^5 




0.069 


0.067 


0.077 


121 


143 




^ / A T Q TP rl ^. 


n (179 


n (ITS 


u.uyo 


08T 


074 




^ / A T Q TP ^ 


0.073 


0.069 


0.083 


0.08T 


0. 120 




Pn 


0.73 


0.69 


0.68 


0.74 


0.64 




\/ AT ^TP rl ^ T ^ 


0.096 


0.091 


0.090 


0.086 


0.093 




V J U O J-/ (1 f{ 


111 


102 


100 


101 


101 




. / A T Q TP ^ 


0.153 


0. 146 


0. 144 


0.158 


U. 14 / 






0.52 


0.47 


0.48 


0.39 


0.50 




VMSE dMS 


0.085 


0.096 


0.086 


0.093 


0.098 




VMSE dR 


0.106 


0.116 


0.097 


0.099 


0.092 




VMSE dN 


0.155 


0.150 


0.56 


0.147 


0.157 




Pn 


0.60 


0.55 


0.49 


0.52 


0.41 


Xs 


VMSE dMS 


0.097 


0.104 


0.097 


0.094 


0.101 




VMSE drt 


0.120 


0.116 


0.117 


0.113 


0.110 




VMSE dN 


0.179 


0.189 


0.177 


0.175 


0.176 




Pn 


0.75 


0.75 


0.68 


0.66 


0.67 



N = 10* 



Model 


^MSE 


d = 


d = 0.1 


d = 0.2 


d = 0.3 


d = 0.4 


Xi 


VMSE dMS 


0.032 


0.029 


0.031 


0.031 


0.036 




VMSE Sr 


0.028 


0.028 


0.029 


0.029 


0.032 




VMSE dN 


0.016 


0.027 


0.034 


0.025 


0.022 






0.97 


0.93 


0.97 


0.94 


0.97 


X2 


VMSE dMS 


0.034 


0.030 


0.029 


0.032 


0.028 




VMSE drt 


0.027 


0.027 


0.029 


0.028 


0.023 




VMSE dN 


0.026 


0.019 


0.019 


0.019 


0.025 




Ptl 


0.95 


0.97 


0.98 


0.96 


0.94 


X3 


VMSE dMS 


0.034 


0.034 


0.033 


0.030 


0.031 




VMSE Sr 


0.029 


0.028 


0.028 


0.028 


0.029 




VMSE dN 


0.027 


0.017 


0.016 


0.022 


0.030 




Pi. 


0.93 


0.96 


0.97 


0.93 


0.92 


Xi 


VMSE dMS 


0.029 


0.060 


0.036 


0.031 


0.031 




VMSE Sr 


0.025 


0.027 


0.029 


0.031 


0.029 




VMSE dN 


0.016 


0.020 


0.021 


0.015 


0.023 






0.95 


0.91 


0.97 


0.92 


0.91 


Xs, 


VMSE dMS 


0.093 


0.046 


0.039 


0.073 


0.047 




VMSE Sr 


0.040 


0.046 


0.035 


0.032 


0.024 




VMSE dN 


0.056 


0.071 


0.027 


0.025 


0.024 




P-n 


0.85 


0.88 


0.93 


0.86 


0.85 


Xe 


VMSE dMS 


0.031 


0.032 


0.033 


0.032 


0.029 




VMSE Sr 


0.029 


0.028 


0.028 


0.028 


0.028 




VMSE dN 


0.045 


0.044 


0.046 


0.044 


0.041 




P-n 


0.96 


0.93 


0.89 


0.93 


0.90 


Xr 


VMSE dMS 


0.030 


0.031 


0.037 


0.030 


0.029 




VMSE Sr 


0.027 


0.027 


0.032 


0.028 


0.027 




VMSE dN 


0.049 


0.044 


0.050 


0.048 


0.046 




Ptl 


0.94 


0.91 


0.88 


0.87 


0.86 


Xs 


VMSE dMS 


0.038 


0.040 


0.040 


0.035 


0.037 




VMSE Sr 


0.039 


0.038 


0.040 


0.036 


0.035 




VMSE dN 


0.085 


0.083 


0.086 


0.087 


0.085 




Pn 


0.92 


0.94 


0.94 


0.95 


0.93 



Table 1: Comparison of the different long-memory parameter estimators for processes of the benchmark. For 
each process and value of d and N , \J MSE are computed from 100 independent generated samples. Here 
Pn = ^ "^{Tn < ^x^l^-silO-^S)) .■ this is the frequency of acceptation of the adaptive goodness- of-fit test. 
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Model I \/MSE I d = \ d = 0.1 \ d=Q.2 \ d = 0.3 \ d = OA \ 



N = 10'^ 







u.uoy 


n nQi 


U. IZo 


U . 1 o z 


n 1 fifi 

U. IDD 




V Al SE dpi 


0.112 


0.111 


0. 119 


0.106 


0.106 




\/ AI S E dj\j 


0.041 


0.0T6 


0.114 


0. 142 


0. 180 






0.82 


0.78 


0.63 


0.59 


0.46 


Trend 


VMSE dMS 


0.548 


0.411 


0.292 


0.190 


0.142 




VAISE Sr 


0.499 


0.394 


0.279 


0.167 


0.091 




VMSE <i„ 


0.044 


0.052 


0.056 


0.060 


0.065 




Pn 


0.83 


0.81 


0.80 


0.73 


0.64 


Ti'end + Seasonality 


VAISE dMS 


0.479 


0.347 


0.233 


0.142 


0.112 




y'AISE Sr 


0.499 


0.393 


0.279 


0.167 


0.091 




x/A/SB dn 


0.216 


0.215 


0.215 


0.217 


0.185 




Pn 


0.35 


0.26 


0.18 


0.21 


0.18 



Model I VAISE \ d = \ d = 0.1 \ d = 0.2 \ d = 0.3 \ d = O.i \ 



N = 10* 



GARMA(0, d, 0) 


VMSE dMS 


0.031 


0.035 


0.039 


0.049 


0.062 




Vaise Sr 


0.028 


0.031 


0.030 


0.030 


0.034 




VAISE div 


0.023 


0.053 


0.052 


0.058 


0.060 




Pre 


0.96 


0.94 


0.93 


0.91 


0.88 


Trend 


VAISE dMS 


0.452 


0.286 


0.167 


0.096 


0.056 




VAISE Sr 


0.433 


0.308 


0.191 


0.100 


0.051 




Vaise d^ 


0.014 


0.016 


0.016 


0.021 


0.028 




Pre 


0.99 


0.97 


0.97 


0.95 


0.93 


Ti'end + Seasonality 


VAISE dMS 


0.471 


0.307 


0.196 


0.123 


0.076 




VAISE Sr 


0.432 


0.305 


0.191 


0.100 


0.052 




Vaise 


0.044 


0.069 


0.047 


0.042 


0.045 




Pre 


0.83 


0.81 


0.76 


0.78 


0.82 



Table 2: Robustness of the different long-memory parameter estimators. For each process and value of d and 
N, V M SE are computed from 100 independent generated samples. Here pn = #(3Ar < (7^2(£_2) (0.95)) .• this 
is the frequency of acceptation of the adaptive goodness-of-fit test. 



Model 


N = 10^ 


N = 10* 


MFARIMA 
MGN 
MFGN 


Pre = 0.58 

Pre = 0.18 
Pre = 0.02 


Pre = 0.87 

Pre = 0.08 

p„ = 0.04 



Table 3: Robustness of the adaptive goodness-of-fit test with Pn — < Qx^(^-2)(0-95)) the frequency of 

acceptation of the adaptive goodness-of-fit test. 

3. a process X denoted MFGN and defined by the increments of a multiscale fractional Brownian motion 
(introduced in Bardet and Bertrand, 2007). Let Z = {Zt)t be such that 

where W{dx) is a complex-valued Gaussian noise with variance dx, H{-) and a{-) are piecewise constant 
functions. Here we consider the functions H{x) = 0.9 for 0.001 < a; < 0.04 and H = 0.1 for 0.04 < a; < 3. 
Then Xt = Zt+i — Zt for t £ Z and X is a Gaussian stationary process which can be written as a linear 
process behaving as a fGn of parameter 0.9 for low frequencies (large time) and as a fGn of parameter 
0.1 for high frequencies (small time). 

We applied the test statistic to 100 independent replications of both these processes. The results of this simu- 
lation are proposed in Table [3l We observed that the processes MGN and MFGN are clearly rejected with the 
adaptive goodness-of-fit test. However, the test is not able to reject the process MFARIMA which does not 
satisfy the Assumption of the Theorem [2] The reason is that the test does an average of the behavior of the 
sample and in the case of changes (it is such the case for MFARIMA) it is the average LRD parameter which 
is estimated (an average of 0.30 for djv and a standard deviation 0.03 arc obtained). 
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5 Proofs 



First, we will use many times the following lemma: 

Lemma 1. If ip satisfies Assumption ^(fc) with k >1, then there exists > such that for all A € M, 



k=l •'^ 



i>{t)e-'^*dt 



< a 



(17) 



Proof of Lemma{Ji This proof is easily established from a mathematical induction on k when A € [— 7r,7r]. 
Then since we consider 27r-periodic functions (of A) the result can be extended to R. □ 

Proof of Property [1[ First, it is clear that for a G N*, (e(a, 6))i<f,<Ar_a is a centered linear process. It is a 

stationary process because X is a stationary process and clearly J2k=i a'^f^^i^^) ^ 

Now following similar computations to those performed in Bardet et al. (2008), we obtain for a G N*, 



E(e2(a, 



du. 



Now, since sup^^gg |'0(u)| < oo, for a large enough. 



E{e\a,Q))- f {-) x \i^{u)\^ du < 2 sup |^(u)|C^ — / {1 + \u\'') f {-)\^j{u)\Uu 



Under Assumption 4'(fc) for any fc > 1, sup„gR(l + u")\ip{u)\ < oo for all n £ N. Therefore, since there exists 
Ca > satisfying /(A) < CaA"^'' for all A G [-7r,7r], for ah d < 1/2, 



{l + \u\'')f{-)\^{u)\'du<(ca / {l + \u\'^)u-^''mu)\^du) 

and thus there exists C > (not depending on a) such that for a large enough. 



.2d 



(e2(a,0))- / /(-) X \i^{u)\^du 



< Ca 



2d-k 



(18) 



Following the same reasoning, for any n > 0, there exists C{n) > (not depending on a) such that for a large 
enough. 



/(^) \ip{u)\^du- I /(^) \$iu)\'du < C{n)a 



(19) 



Finally, from Assumption A(o?, d'), we obtain the following expansion: 



f{^)mu)\'du = 2n {c,{^)-'' + c A- f + f''M-))mu)\'du 
, a' J_ryo a' a' a' a ' 



(20) 



using the definition ([5]) of A'(^, q,) and because lim^-i-o £(A) — and applying Lebcsgue Theorem. Then, using 
(US]), (HH) and (HOI), we obtain that 



E(e^(a,0)) - 2t: CdK(^^2d) a + 27rcd' K{^^2d-d') 



g^d-d' 



When fc > d', it implies (g]). 



(21) 



□ 



Proof of Theorem]^ We decompose this proof in 4 steps. First define the normalized wavelet coefficients of 
A by: 



eAr(a, b) := a gW and 6 G Z, 

VEe2(a,l) 



(22) 
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and the normalized sample variance of wavelet coefficients: 

N-a 

TN{a):=j^^Y.^{a,k). (23) 
fe=i 

Step 1 We prove in this part that A^Cov [Tjsiir ajv), T^ir' un)) converges to the asymptotic covariance matrix 
r(£i, • • • , r^, V', d) defined in ([9]). First for A G M, denote 

Sa{X) :=i V^(-)e'^*/r 

Then for a e N* and b = 1, • • • ,N — a, since ip is (0, l])-supported function, 

N , a „^ 

t=i ^ t=o ^ •'-'^ 

5„(A)S(-)e*(''-^)^dA. (24) 
Thus, 

Coy{f^ia),fr,ia')) = ^ ^ J] Gov (S2(a, &), ^^(a', 6')) 

h=\ 6' = 1 

But, 

Gov (e^,,,) , e^,, ,,)) - ^ E E ( fl "(*^ " ^^^^^^O ( H " ^')^(^)) (^^^^^^ ' ^^3^^^) 

= C1+C2, (26) 



since there are only two nonvanishing cases: s\ = S2 = S3 = S4 (Case 1 => Ci), si = S3 7^ S2 = S4 and 
si = S4 7^ S2 = S3 (Case 2 => C2). 

* Case 1; in such a case, Cov (■Csi'Csa j Csa^sa) = tM — ^ and 

a a ^-^ I a I I a 

sgZ t=l t=l 

Ci = (Ai4- l)aa' lim / (iAdA'd/id/^'e*[''(^-^')+'''(^-^')l 

X ^ e'^[(^-^')+('^-'''%a(aA)a(A)5,(aA')S(A')^a'(aV)S(/.)5a'(a'/i')S(/i') 

s=-A/ 

using the relation ([24]) . From usual asymptotic behavior of Dirichlct kernel, for 17 e C27r((^7r, tt)), lim / DM{z)g(x~ 

^ ^ 1 sin ((2Af + 1)0/2) 1 ^ 

z)(iz = g(x) uniformly in x with Dm{z) ~ 7 — —\ = — > e . Therefore with h&C func- 

27r sin (z/2) 27r _ ^ 

tion 27r-periodic for each component, 

lim / 2iT D m{.{.X - X') + {^- ^i'))h{X,X' ,^i,^i')dXdX'd^idii' = 2tt / h{X' - + ^i' ,X' , ^i, ^i')dX' d^id^i' \ 



M-i-oo 
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Therefore, 

Ci ^ 27r (^4 - 1) a a' I dX'dfidfi'e'^^'~^''^^'''-'''> 

J [—TT,7t]'^ 



X Sa{a{\' - n + n'))5{X' - n + ti')Sa{ay)5{X')Sa'{a'fi)a{fi)Sa'{a'fi')5{fi'). (27) 
* Case 2: in such a case, with si ^ S2, Gov (Csi^sai CsiCsa) — 1 and 

9 ^ i -h i -h' ^ i -^h ^ i 

^2 = ^ E E - ^)^(^) E "(^2 - s)v^(^) E - s')v^(^) E "(^4 - s'm-^ 

a a' ^-^ ^-^ a ^-^ a ^-^ a ^-^ 

(S,S')6Z2,S^S' tl = l t2 = l t3 = l t4. = l 



C-2 — 



T + i E E "(^1 - ^)^( V) E - V^) E "(^^ - V) E 

(s,s')eZ2 tl = l t2 = l t3=l t4 = l 

— h 2 a a' hm hm / dXdX' dadu! 

1 M^oo M'^oo 7r ^.^,4 



A/ M' 

^ J2 e*^(^'-^)+'^'(^'-^'5,(aA)a(A)5a,(a'A')a(A')5a(aM)«(M)^a'(aV)S(/.') 

= L^ + gTT^aa' / e'^^-''^'^''-'''^Sa{aX)Sa'ia'X)Saia^l)Sa'ia'^l) x |S(A)P l^dAd^, 

using the asymptotic behaviors of two Dirichlet kernels. 

N-a N-a' 

Now we have to compute E] E] '•^i + In both cases (Ci and C2), one again obtains a function of a 

6=1 6' = l 

Dirichlet kernel: 

Fj,{a,a',v) y ^^M^^.')/2Bin((iV~aW2) sin((iV - a0z;/2) ^ ^^8) 
tl ^1 sin (t;/2) 

For a continuous function /i : [— tt, tt] i— > R, 

1 /"'^"'^ 

^^^^^ / hi ii\T^^T ( n n' lAdn — 1im / 

N^oc N 

thanks to Lebcsguc Theorem and with a/N ^ {N — > 0). Then, from p7| 

N-aN- 



l r 1 /""^ V V f°° sin^fw/2) 
lim - / hiv)FN{a,a',v)dv ^ lim — h{-)FN{a, a' , -)dv ^ Ah{0) \^dv ^ 27rh{0), 



N. ' ' 



N -aN -a 



tE E Ci^47r2(^4-l)aa' / dX'd^l'\Sa{aX')\^ \Sa'{a' \aiX')\^W)\' 

b=l b' = l J[-7r.7r]2 



47r^(A*4-l)/ |5,(A)|^|S(A/a)|^dA / \Sain)\' \a{^l/a')\'d^^ 



a TT 



a'TT 



0—1 b' — l 

using the same arguments than in Property [1] since cat — > 00 (and therefore a — > 00 and a' — > 00). 
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Moreover, if we consider that a = rajsf and a' — r'a^-, 

N-aN-a' „7r -, -, N-aN-a' 



6=1 b' = l '^^'^ ^ 6=1 t' = l 

/aivir ^ 2 4 2iV 1 1 W-raw W-r'aiv 

|'S'ra„(?'A)| |S',.'Q„(r'A)| |a(A/ajv)| dX -— — Ci 
^a^T^ - 1 iV - raw iV - r'aw ^ 

47r2(iV-ra^)(7V-r'a^) ^ '^n^^ ^ ^7^.2.) i-oo 

always using the same trick than in Property [1] since a — > cxj and a' 00. Therefore, with pop . one deduces 
that: 



N—riaN 



■ Gov (Tat (r Cat), Tjv(r'ajv)) — 47r— 5 / — ^ dX. (31) 

OAT A'->oo ^U,2d) J-oo A4'' 

Note that if r = r' then Var (T]s[(r aN)) — > (T^,{d) ~ 647r^ — {if>*if>,4:d) depending on ip and d. 

raN JV^oo ^U,2d) 

Step 2 We prove here that if the distribution of the innovations {£,t)t is such that there exists ?- > sat- 
isfying E(e'"^") < 00 (condition so-caUed the Cramer condition), then for any a E N* , (TAr(ri ajv))i<i<£ = 
'^k=i''^" e^(riaAr,fc)^ satisfies a central limit theorem. Such theorem is implied by the proof 

that X]i=i N-r aN X^i^i'"'"" ^{'I'iO-N, k) asymptotically follows a Gaussian distribution for any (wi)i<i<f € 

M^. For establishing this result we are going to adapt a proof of Giraitis (1985) which shows central limit the- 
orems for function of linear processes using a decomposition with AppcU polynomials. Indeed we specified 
that X is a two-sided linear process and therefore martingale type results as in Wu (2002) or Furmanczyk 
(2007) can not be applied. Moreover, since ojv is a sequence depending on TV it is required to prove a central 
limit theorem for triangular arrays. Unfortunately the recent paper of Roueff and Taqqu (2009) dealing with 
central limit theorems for arrays of decimated linear processes, and which can be applied to establish a mul- 
tidimensional central limit for the variogram of wavelet coefBcients associated to a multi-resolution analysis 
can not be applied here because in this paper this variogram is defined as in ([7]) with coefficients taken every 
n/uj (~ aisi with our notation) and the mean of nj (N/qn with our notation) coefficients is considered (and 
the convergence rate is ^ynj) ■ Our definition of the wavelet coefficient variogram ^ is an average of iV — 
terms and the convergence rate is N/gn- Then we chose to adapt the results and method of Giraitis (1985). 
More precisely, consider the case i = 1. For a > 0, (e(a, b))i<cb<N-a is a stationary linear process satisfying as- 
sumptions of the paper of Giraitis (called Xt in this article) . Now we consider H2 (x) = — 1 the second-order 
Hermite polynomial and would like to prove that 

AT 1/2 1 N—ajv ^ —1/2 

6=1 6=1 

Now since the distribution of is supposed to satisfy the Gramer condition, following the proof of Proposition 
6 (Giraitis, 1985), define S"^^ = A^n"\eiaN,b)) where A^^"^ is the Appell polynomial of degree n 

corresponding to the probability distribution of e{aN, •). We are going to prove that the cumulants of order 
fc > 3 are such as 

x(4"^^»,...,5(;('=»)=o((TVaA.)'=/2) (32) 
for any n(l), • • • , n{k) > 2 (the computation of the cumulant of order 2 is induced by Step 1 of this proof). 
Indeed, x(5^^^'''*, . . . , S'^'"'^'*'') — J2-yerQ(T)'^'y-^'r(-^) where To{T) is the set of possible diagrams and the 
definition of If{N) is provided in (34) of Giraitis (1985). 

In the case of Gaussian diagrams, Ij{N) = o((iVaAr)'^/^) , since this case is induced by the Gaussian case and 
the second order moments. 

If 7 is a non Gaussian diagram, mutatis mutandis, we are going to follow the notation and proof of Lemma 2 
of Giraitis (1985). Note first from Step 1, we can write: 

e(a,&) = ^/3a(6-s)Cs with /3,(s) = f 5,(aA)a(A)e^^^dA. (33) 
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Then for u G [— 7r,7r], 

^ oo 

y S'a(aA)a(A)e'^(^-")dA 

-rr 



27r 

S — — OO 



/a 



VEe2(a,6) 



5a(au)a(u), 



with the asymptotic behavior of Dirichlet kernel. Now, in case a/ of Lemma 2 of Giraitis (1985), consider 
the diagram Vi = {(1, 1), (2, 1), (3, 1)} and assume that for the rows Lj of the array T, j = 1, • • • , fc (fc > 3), 
I Vi n ijl > 1 for at least 3 different rows Lj. Then the inequality (39) can be repeated, and on the hyperplane 
part of the integral (34) provides 



3 



/ dxiidx2idx3i Y\_DN{{xji ^ ^ Xjn(j))f3a{xji) < C ai{ui)a2{u2)a3{u3), 

with Ui = Xi2 + • • • + a;in(i) 8'iid the same expressions of provided in Giraitis (1985). It remains to bound 
ai{u). But, with the same approximations as in the proof of Property [U for a and N large enough 

\f5a{u) Dn{x + u)\dx ^ \/2Tr —r= / |- — | |-Div( — hu)|du 

< 2V^snpO-^i^} r \D Nix + u)\dx 

< 2Csup{M^}VilogiV, 

since there exists C > such as \Dn{x + u)\dx < ClogN for any u G [— 7r,7r]. Now for i = 2,3, a and N 
large enough, 

= Il?a„(-)^iv("+-)ll2 

< 2Csnp{^-^i^}aN r \Dlix + u)\dx 
Then ai(ui)Q!2(u2)Q!3(u3) = o{{NaN)^^'^)- 

For the /c — 3 other terms, a result corresponding to Lemma 1 of Giraitis (1985) can also be obtained. Indeed, 
for a and N large enough. 



ffJVj||2 = / dxD%{xi^ ^ 2:n(j)) H 

J[-7r,7r]"0) -^^ 

/ dxD%{-{xi-\ ^a;„(j))TT 



< c 



< ^l-p{^}r«ll^-(-)lll 

xeR FI 

< C Nun 

with C" > not depending on and un- Thus ||5iv.j||2 < C {Nafq)^/"^ with C > 0. Using the same 
reasomng, there also exists C" > such as Wg'^^h < C {Nun)^^'^ for j >2 while ||5^_il|2 = 0(^0]^ logiV) = 
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o{{NaNY^'^)- a consequence, for 7 such as \Vi nLj\> 1 for at least 3 different rows Lj, and more generally 
with \Vi \ > 3, 

/^(iV) = o((Araw)'=/2). (34) 

For other 7, it remains to bound the function h(ui,U2) defined in Giraitis (1985, p. 32) as follows (with 
X = .Til + •"2^12) and with iti + 1*2 7^ 0: 



h{ui,U2) = \Pai^i-x) DNiui + x) DNiu2 - x)\dx^ \'^aNix)\^dx^ 



< 









J — TT 



\^\2d 



dx 



But 



\D IS! {ui + x) D IS! {u2 — x)\dx < 2 



2ttN 



2txN 



sin(x) sin(^(iii + 1*2) - a;) 



X sin(i(ui +U2) - 4) 



dx 



< 



Clog7V|sin(i(ui+U2))| ' if |ui +U2I > (iVlogAT)-! 
CN if jui +U2I < (AriogA^)^i 



Therefore, 

\\h{ui,U2)\\ 



I h'^{ui,U2)duidu2 < Caljflog^N [ 

./[-ir,7rl2 ^ J(N 



{smxy"^ dx + 



(TV log W)-! 



(NlogN)- 



dx 



< Ca%{N\og^N + NlogN), 
and hence ||/i(ui, U2)||2 = o{NaN)- Finally, (p4| holds for all 7 and it implies ([32]) . 

If > 1, the same proof can be repeated from the linearity properties of cumulants. Thus, {T^iri aN))i<i<e 
satisfies the following central limit: 



^ (TAr(r, OTv) - l)i<j<^ A/'(0, r(ri, • • • ,ri,'ip,d)) , 



(35) 



with r(ri, 



,'ip,d) = {%j)i<i,j<£ given in ©. 



Step 3 Now we extend the central limit obtained in Step 2 for linear processes with an innovation distri- 
bution satisfying a Cramer condition (E(e''^°) < 00) to the weaker condition E^q < 00 using a truncation 
procedure. Thus assume now that E^q < 00. Let M > and define = CtI|c|<J\/ a-^d = CtI|C|>J\/7 
e"(a, 6) = J2s£Z /3a(&-s)^7 and £+(0,6) = I^sgz /^a(&-s)^^ using ([33l). Clearly e (a, 6) = e+(a, 6) + e"(a, 5). 
We are going to prove that ([55]) holds. For this, we begin by writing 

Tjv(na7v)-1 = ]\f ( (e~(riajv,6))^ - 1) - 2e+(riaAr,6)e"(r,aAr,6) + (e+(ria7v,&))^) (36) 



6=1 



We first prove that (r^(riajv) - l)i<,<^ = (ivd^ EbL/"''" (e (r,;aAr,6))^ - l);^<^<^ also satisfies 
Indeed, {e~ (riUN, b)) is a linear process with innovations (^j") satisfying the Cramer condition and it is obvious 

^^^^ /E(.(..a„,.))- y/^^ 



e (riflAT, fe)f,^i has exactly the same distribution than eijiaN, b)b.i- Therefore it remains 



to prove that 



and E^o = 1 (from Property [T|) . Then 



E(e- (riaN,6))2 

-operty[T|). T 

E(e-(r,aw,&))2 



E(e(r,aAr,&))2 



1) converges to 0. We have ¥.{e{r^aN ,h)f = (E^ez = 1 

< 2 (E(e+(r,aw, 6))^) + E(e+(r,ajv, 6))'. 



We have E(e+(riaAr, 6))^ = ( Esez /^a('S)) ^l^CCcr)^ IE(^(j')^ from previous arguments and since we assume that 
the distribution of ^0 is symmetric. But using Holder's and Markov's inequalities E(^+)2 < (E^;^)i/2(Pi-(|^g| > 
M))i/2 < (]E^4^ M~'^. Hence, there exists C > not depending on M and N, 



E(e-(r,aw,&))2 



E e(r,ajv,6))2 



- 1 



C 

M N- 
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when M = N (for instance). Therefore (T^{ri qn) — l)i<j<^ satisfies the CLT ([55)1 . 
From p6p . it remains to prove that 



\ — -T-j ( -2e+(r,;aAr,6)e"(r,;ajv,6) + (e+(riaAr,6)) ) — 0. 



From Markov's and Holder inequalities, this is implied when y ^ (E(e+(riaAr, &)) +2y E(e+(riajv, &)) ) — > 

with E(er+(riaAr, &)) = 1. Using E(e+(r,aAr, < (E^^) Af^^ obtained above, we deduce that this statement 
holds when M — N (for instance). As a consequence, from the CLT holds even if the distribution 
of ^0 is only symmetric and such that E^q < oo. 

Step 4 It remains to apply the Delta-method to ([35|) with function (xi, • • • , xe) M- (logxi, • • • , logx^): 

— (log(Tjv(r,ajv))-log(Ee2(aAr,l)))^<^<^ AA(0 , r(ri, • • • ,n,^,d)), 

With Ee^(a7v, 1) provided in Property [TJ we obtain 

logEeHa^, 1) = 2d log(aA.) + log (ff^M) + ^'^'^^^^-^0 (i + 

/ iV 1 1 

Therefore, when W ^ — > 0, i.e. A^^r+^ = o(aAr), the CLT (gl) holds. □ 

V fljv N^oo 

Proof of Theorem\^ Here we use Theorem 1 of Bardet et al. (2008) where it was proved that the CLT ^ is 
still valid when is replaced by iV"" . Then, sincedjv = YAr(5Ar) with Mat = (O 1/2) (Z(r^^Zi) "^Z(r^^ 



we deduce that N /N"-'^ {djsi ~d}j is asymptotically Gaussian with asymptotic variance the limit in probability 
of MArr(l,...,f,d,V')M^, that is cr^ 

The right hand side relation of is also an obvious consequence of Theorem 1 of Bardet et al. (2008). □ 
Proof of Theorem\^ The theory of linear models can be applied: Zj^a,^ ( r,7 ) is an orthogonal projector 

of Y/v(ajv) on a subspace of dimension 2, therefore YN(aN) — Zj^a^ [ ~ ) is an orthogonal projector of 

Ym{olm) on a subspace of dimension ^ — 2. Moreover, using the CLT where ajy is replaced by A^"™, we 
deduce that yj N/N"'^ T~j^YN{aN) asymptotically follows a Gaussian distribution with asymptotic covariance 
matrix 1^ (identity matrix). Hence form Cochran Theorem we deduce (|15p. □ 
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